Principal Component Analysis of Interval Data: a Symbolic Data Analysis Approach
نویسندگان
چکیده
The present paper deals with the study of continuous interval data by means of suitable Principal Component Analyses (PCA). Statistical units described by interval data can be assumed as special cases of Symbolic Objects (SO) (Diday, 1987). In Symbolic Data Analysis (SDA), these data are represented as hypercubes. In the present paper, we propose some extensions of the PCA with the aim of representing, in a space of reduced dimensions, images of such hypercubes, pointing out differences and similarities according to their structural features.
منابع مشابه
Outlier Detection in Wireless Sensor Networks Using Distributed Principal Component Analysis
Detecting anomalies is an important challenge for intrusion detection and fault diagnosis in wireless sensor networks (WSNs). To address the problem of outlier detection in wireless sensor networks, in this paper we present a PCA-based centralized approach and a DPCA-based distributed energy-efficient approach for detecting outliers in sensed data in a WSN. The outliers in sensed data can be ca...
متن کاملDevelopment of a cell formation heuristic by considering realistic data using principal component analysis and Taguchi’s method
Over the last four decades of research, numerous cell formation algorithms have been developed and tested, still this research remains of interest to this day. Appropriate manufacturing cells formation is the first step in designing a cellular manufacturing system. In cellular manufacturing, consideration to manufacturing flexibility and productionrelated data is vital for cell formation....
متن کاملA two-stage principal component analysis of symbolic data using equicorrelated and jointly equicorrelated covariance structures
A new approach to derive the principal components of symbolic data is proposed in this article. This is done in two stages: first getting eigenblocks and eigenmatrices of the variancecovariance matrix, and then analyzing these eigenblocks and the corresponding principal vectors together in some seemly sense to get the adjusted eigenvalues and the corresponding eigenvectors of the interval data....
متن کاملSymbolic Covariance Matrix for Interval-valued Variables and its Application to Principal Component Analysis: a Case Study
In the last two decades, principal component analysis (PCA) was extended to interval-valued data; several adaptations of the classical approach are known from the literature. Our approach is based on the symbolic covariance matrix Cov for the interval-valued variables proposed by Billard (2008). Its crucial advantage, when compared to other approaches, is that it fully utilizes all the informat...
متن کاملDetection of Monotonic Chain Structures in Mixed Feature Type Multidimensional Data
Symbolic data analysis aims at generalizing some standard statistical methods. Generalization of principal component analysis (PCA) is an interesting and important research problem in symbolic data analysis. A main purpose of PCA is to find a linear structure in multidimensional data. However, a direct extension of PCA is difficult, when each object is described by not only usual quantitative f...
متن کامل